Picture for Jingye Chen

Jingye Chen

MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale

Add code
May 26, 2026
Viaarxiv icon

Does Synthetic Layered Design Data Benefit Layered Design Decomposition?

Add code
May 14, 2026
Viaarxiv icon

Advancing Open-source World Models

Add code
Jan 28, 2026
Viaarxiv icon

Geometric-Mean Policy Optimization

Add code
Jul 28, 2025
Viaarxiv icon

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Add code
Mar 27, 2025
Viaarxiv icon

AvatarArtist: Open-Domain 4D Avatarization

Add code
Mar 26, 2025
Viaarxiv icon

Large Motion Video Autoencoding with Cross-modal Video VAE

Add code
Dec 23, 2024
Figure 1 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 2 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 3 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 4 for Large Motion Video Autoencoding with Cross-modal Video VAE
Viaarxiv icon

TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization

Add code
Aug 07, 2024
Figure 1 for TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
Figure 2 for TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
Figure 3 for TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
Figure 4 for TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
Viaarxiv icon

LLMs Meet Multimodal Generation and Editing: A Survey

Add code
May 29, 2024
Figure 1 for LLMs Meet Multimodal Generation and Editing: A Survey
Figure 2 for LLMs Meet Multimodal Generation and Editing: A Survey
Figure 3 for LLMs Meet Multimodal Generation and Editing: A Survey
Figure 4 for LLMs Meet Multimodal Generation and Editing: A Survey
Viaarxiv icon

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

Add code
Nov 28, 2023
Figure 1 for TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Figure 2 for TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Figure 3 for TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Figure 4 for TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Viaarxiv icon